Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 8998 |
| Missing cells | 552 |
| Missing cells (%) | 0.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.1 MiB |
| Average record size in memory | 128.0 B |
Variable types
| NUM | 11 |
|---|---|
| CAT | 4 |
| BOOL | 1 |
income is highly correlated with age | High correlation |
age is highly correlated with income | High correlation |
mnt is highly correlated with frq | High correlation |
frq is highly correlated with mnt | High correlation |
dependents has 282 (3.1%) missing values | Missing |
status has 177 (2.0%) missing values | Missing |
kitchen has 833 (9.3%) zeros | Zeros |
toys has 815 (9.1%) zeros | Zeros |
house_keeping has 851 (9.5%) zeros | Zeros |
Reproduction
| Analysis started | 2020-10-12 08:45:42.683345 |
|---|---|
| Analysis finished | 2020-10-12 08:46:11.281520 |
| Duration | 28.6 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 61 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1966.05968 |
|---|---|
| Minimum | 1936 |
| Maximum | 1996 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.3 KiB |
Quantile statistics
| Minimum | 1936 |
|---|---|
| 5-th percentile | 1939 |
| Q1 | 1951 |
| median | 1966 |
| Q3 | 1981 |
| 95-th percentile | 1993 |
| Maximum | 1996 |
| Range | 60 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 17.29655221 |
|---|---|
| Coefficient of variation (CV) | 0.008797572313 |
| Kurtosis | -1.195990117 |
| Mean | 1966.05968 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 0.007954084276 |
| Sum | 17690605 |
| Variance | 299.1707182 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 1974 | 171 | 1.9% | |
| 1951 | 169 | 1.9% | |
| 1992 | 168 | 1.9% | |
| 1979 | 167 | 1.9% | |
| 1976 | 166 | 1.8% | |
| 1961 | 165 | 1.8% | |
| 1960 | 164 | 1.8% | |
| 1949 | 163 | 1.8% | |
| 1959 | 162 | 1.8% | |
| 1978 | 160 | 1.8% | |
| Other values (51) | 7343 | 81.6% |
| Value | Count | Frequency (%) | |
| 1936 | 75 | 0.8% | |
| 1937 | 140 | 1.6% | |
| 1938 | 146 | 1.6% | |
| 1939 | 137 | 1.5% | |
| 1940 | 153 | 1.7% |
| Value | Count | Frequency (%) | |
| 1996 | 81 | 0.9% | |
| 1995 | 147 | 1.6% | |
| 1994 | 159 | 1.8% | |
| 1993 | 157 | 1.7% | |
| 1992 | 168 | 1.9% |
| Distinct | 8524 |
|---|---|
| Distinct (%) | 95.2% |
| Missing | 46 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69963.55083 |
|---|---|
| Minimum | 10000 |
| Maximum | 140628 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.3 KiB |
Quantile statistics
| Minimum | 10000 |
|---|---|
| 5-th percentile | 26314.6 |
| Q1 | 47741 |
| median | 70030.5 |
| Q3 | 92218 |
| 95-th percentile | 113395.3 |
| Maximum | 140628 |
| Range | 130628 |
| Interquartile range (IQR) | 44477 |
Descriptive statistics
| Standard deviation | 27591.55623 |
|---|---|
| Coefficient of variation (CV) | 0.3943704386 |
| Kurtosis | -0.9293280359 |
| Mean | 69963.55083 |
| Median Absolute Deviation (MAD) | 22214.5 |
| Skewness | 0.008688890946 |
| Sum | 626313707 |
| Variance | 761293975 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 10000 | 35 | 0.4% | |
| 64185 | 4 | < 0.1% | |
| 83455 | 3 | < 0.1% | |
| 33542 | 3 | < 0.1% | |
| 99452 | 3 | < 0.1% | |
| 66184 | 3 | < 0.1% | |
| 37902 | 3 | < 0.1% | |
| 39782 | 3 | < 0.1% | |
| 51743 | 3 | < 0.1% | |
| 49948 | 3 | < 0.1% | |
| Other values (8514) | 8889 | 98.8% | |
| (Missing) | 46 | 0.5% |
| Value | Count | Frequency (%) | |
| 10000 | 35 | 0.4% | |
| 10182 | 1 | < 0.1% | |
| 10186 | 1 | < 0.1% | |
| 10608 | 1 | < 0.1% | |
| 10886 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 140628 | 1 | < 0.1% | |
| 137338 | 1 | < 0.1% | |
| 137053 | 1 | < 0.1% | |
| 136922 | 1 | < 0.1% | |
| 136213 | 1 | < 0.1% |
| Distinct | 57 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.84807735 |
|---|---|
| Minimum | 3 |
| Maximum | 59 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.3 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 10 |
| median | 17 |
| Q3 | 28 |
| 95-th percentile | 40 |
| Maximum | 59 |
| Range | 56 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 10.90343461 |
|---|---|
| Coefficient of variation (CV) | 0.549344625 |
| Kurtosis | -0.4139889171 |
| Mean | 19.84807735 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.6977790699 |
| Sum | 178593 |
| Variance | 118.8848863 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 10 | 635 | 7.1% | |
| 9 | 583 | 6.5% | |
| 11 | 513 | 5.7% | |
| 8 | 493 | 5.5% | |
| 12 | 418 | 4.6% | |
| 7 | 325 | 3.6% | |
| 13 | 316 | 3.5% | |
| 14 | 282 | 3.1% | |
| 21 | 238 | 2.6% | |
| 25 | 233 | 2.6% | |
| Other values (47) | 4962 | 55.1% |
| Value | Count | Frequency (%) | |
| 3 | 5 | 0.1% | |
| 4 | 24 | 0.3% | |
| 5 | 87 | 1.0% | |
| 6 | 173 | 1.9% | |
| 7 | 325 | 3.6% |
| Value | Count | Frequency (%) | |
| 59 | 2 | < 0.1% | |
| 58 | 1 | < 0.1% | |
| 57 | 1 | < 0.1% | |
| 56 | 3 | < 0.1% | |
| 55 | 3 | < 0.1% |
rcn
Real number (ℝ≥0)
| Distinct | 378 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 62.46977106 |
|---|---|
| Minimum | 0 |
| Maximum | 549 |
| Zeros | 44 |
| Zeros (%) | 0.5% |
| Memory size | 70.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 26 |
| median | 53 |
| Q3 | 79 |
| 95-th percentile | 99 |
| Maximum | 549 |
| Range | 549 |
| Interquartile range (IQR) | 53 |
Descriptive statistics
| Standard deviation | 69.76180219 |
|---|---|
| Coefficient of variation (CV) | 1.116728956 |
| Kurtosis | 21.09692287 |
| Mean | 62.46977106 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 4.174006567 |
| Sum | 562103 |
| Variance | 4866.709045 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 9 | 107 | 1.2% | |
| 56 | 105 | 1.2% | |
| 64 | 103 | 1.1% | |
| 4 | 102 | 1.1% | |
| 29 | 102 | 1.1% | |
| 27 | 100 | 1.1% | |
| 92 | 100 | 1.1% | |
| 54 | 99 | 1.1% | |
| 68 | 99 | 1.1% | |
| 17 | 98 | 1.1% | |
| Other values (368) | 7983 | 88.7% |
| Value | Count | Frequency (%) | |
| 0 | 44 | 0.5% | |
| 1 | 91 | 1.0% | |
| 2 | 92 | 1.0% | |
| 3 | 91 | 1.0% | |
| 4 | 102 | 1.1% |
| Value | Count | Frequency (%) | |
| 549 | 3 | < 0.1% | |
| 547 | 1 | < 0.1% | |
| 546 | 3 | < 0.1% | |
| 542 | 2 | < 0.1% | |
| 540 | 1 | < 0.1% |
| Distinct | 717 |
|---|---|
| Distinct (%) | 8.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 622.162814 |
|---|---|
| Minimum | 6 |
| Maximum | 3052 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.3 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 63 |
| median | 383 |
| Q3 | 1076 |
| 95-th percentile | 1917.15 |
| Maximum | 3052 |
| Range | 3046 |
| Interquartile range (IQR) | 1013 |
Descriptive statistics
| Standard deviation | 646.7682046 |
|---|---|
| Coefficient of variation (CV) | 1.039548154 |
| Kurtosis | -0.05809376933 |
| Mean | 622.162814 |
| Median Absolute Deviation (MAD) | 343 |
| Skewness | 0.9809806035 |
| Sum | 5598221 |
| Variance | 418309.1104 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 19 | 158 | 1.8% | |
| 41 | 121 | 1.3% | |
| 20 | 108 | 1.2% | |
| 40 | 89 | 1.0% | |
| 42 | 88 | 1.0% | |
| 64 | 86 | 1.0% | |
| 66 | 78 | 0.9% | |
| 65 | 76 | 0.8% | |
| 92 | 61 | 0.7% | |
| 118 | 56 | 0.6% | |
| Other values (707) | 8077 | 89.8% |
| Value | Count | Frequency (%) | |
| 6 | 1 | < 0.1% | |
| 7 | 2 | < 0.1% | |
| 8 | 8 | 0.1% | |
| 9 | 14 | 0.2% | |
| 10 | 22 | 0.2% |
| Value | Count | Frequency (%) | |
| 3052 | 1 | < 0.1% | |
| 2938 | 1 | < 0.1% | |
| 2936 | 1 | < 0.1% | |
| 2878 | 1 | < 0.1% | |
| 2823 | 1 | < 0.1% |
clothes
Real number (ℝ≥0)
| Distinct | 99 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.44665481 |
|---|---|
| Minimum | 1 |
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 33 |
| median | 51 |
| Q3 | 69 |
| 95-th percentile | 88 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 36 |
Descriptive statistics
| Standard deviation | 23.42224892 |
|---|---|
| Coefficient of variation (CV) | 0.4642973653 |
| Kurtosis | -0.9185954232 |
| Mean | 50.44665481 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | -0.07821931254 |
| Sum | 453919 |
| Variance | 548.6017444 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 55 | 150 | 1.7% | |
| 40 | 141 | 1.6% | |
| 41 | 139 | 1.5% | |
| 47 | 139 | 1.5% | |
| 70 | 137 | 1.5% | |
| 56 | 136 | 1.5% | |
| 58 | 136 | 1.5% | |
| 46 | 135 | 1.5% | |
| 31 | 133 | 1.5% | |
| 57 | 133 | 1.5% | |
| Other values (89) | 7619 | 84.7% |
| Value | Count | Frequency (%) | |
| 1 | 2 | < 0.1% | |
| 2 | 15 | 0.2% | |
| 3 | 24 | 0.3% | |
| 4 | 38 | 0.4% | |
| 5 | 44 | 0.5% |
| Value | Count | Frequency (%) | |
| 99 | 1 | < 0.1% | |
| 98 | 1 | < 0.1% | |
| 97 | 9 | 0.1% | |
| 96 | 17 | 0.2% | |
| 95 | 25 | 0.3% |
| Distinct | 58 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.039675483 |
|---|---|
| Minimum | 0 |
| Maximum | 75 |
| Zeros | 833 |
| Zeros (%) | 9.3% |
| Memory size | 70.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 10 |
| 95-th percentile | 23 |
| Maximum | 75 |
| Range | 75 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 7.84813931 |
|---|---|
| Coefficient of variation (CV) | 1.114843906 |
| Kurtosis | 5.619265964 |
| Mean | 7.039675483 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.049458185 |
| Sum | 63343 |
| Variance | 61.59329064 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 1 | 1326 | 14.7% | |
| 2 | 1005 | 11.2% | |
| 0 | 833 | 9.3% | |
| 3 | 761 | 8.5% | |
| 4 | 644 | 7.2% | |
| 5 | 584 | 6.5% | |
| 6 | 506 | 5.6% | |
| 7 | 404 | 4.5% | |
| 8 | 372 | 4.1% | |
| 10 | 289 | 3.2% | |
| Other values (48) | 2274 | 25.3% |
| Value | Count | Frequency (%) | |
| 0 | 833 | 9.3% | |
| 1 | 1326 | 14.7% | |
| 2 | 1005 | 11.2% | |
| 3 | 761 | 8.5% | |
| 4 | 644 | 7.2% |
| Value | Count | Frequency (%) | |
| 75 | 1 | < 0.1% | |
| 67 | 1 | < 0.1% | |
| 65 | 1 | < 0.1% | |
| 61 | 1 | < 0.1% | |
| 59 | 1 | < 0.1% |
small_appliances
Real number (ℝ≥0)
| Distinct | 73 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.52411647 |
|---|---|
| Minimum | 1 |
| Maximum | 74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 19 |
| median | 28 |
| Q3 | 37 |
| 95-th percentile | 50 |
| Maximum | 74 |
| Range | 73 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 12.5864368 |
|---|---|
| Coefficient of variation (CV) | 0.4412559742 |
| Kurtosis | -0.4230030191 |
| Mean | 28.52411647 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.3146456491 |
| Sum | 256660 |
| Variance | 158.4183913 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 23 | 286 | 3.2% | |
| 22 | 277 | 3.1% | |
| 26 | 276 | 3.1% | |
| 27 | 269 | 3.0% | |
| 19 | 268 | 3.0% | |
| 25 | 264 | 2.9% | |
| 30 | 262 | 2.9% | |
| 28 | 261 | 2.9% | |
| 31 | 254 | 2.8% | |
| 29 | 236 | 2.6% | |
| Other values (63) | 6345 | 70.5% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 2 | 2 | < 0.1% | |
| 3 | 13 | 0.1% | |
| 4 | 39 | 0.4% | |
| 5 | 53 | 0.6% |
| Value | Count | Frequency (%) | |
| 74 | 2 | < 0.1% | |
| 73 | 1 | < 0.1% | |
| 72 | 1 | < 0.1% | |
| 70 | 1 | < 0.1% | |
| 69 | 3 | < 0.1% |
| Distinct | 58 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.036897088 |
|---|---|
| Minimum | 0 |
| Maximum | 62 |
| Zeros | 815 |
| Zeros (%) | 9.1% |
| Memory size | 70.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 10 |
| 95-th percentile | 23 |
| Maximum | 62 |
| Range | 62 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 7.924421742 |
|---|---|
| Coefficient of variation (CV) | 1.126124433 |
| Kurtosis | 5.644657211 |
| Mean | 7.036897088 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.096047402 |
| Sum | 63318 |
| Variance | 62.79645995 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 1 | 1370 | 15.2% | |
| 2 | 988 | 11.0% | |
| 0 | 815 | 9.1% | |
| 3 | 779 | 8.7% | |
| 4 | 675 | 7.5% | |
| 5 | 542 | 6.0% | |
| 6 | 499 | 5.5% | |
| 7 | 409 | 4.5% | |
| 8 | 344 | 3.8% | |
| 9 | 295 | 3.3% | |
| Other values (48) | 2282 | 25.4% |
| Value | Count | Frequency (%) | |
| 0 | 815 | 9.1% | |
| 1 | 1370 | 15.2% | |
| 2 | 988 | 11.0% | |
| 3 | 779 | 8.7% | |
| 4 | 675 | 7.5% |
| Value | Count | Frequency (%) | |
| 62 | 1 | < 0.1% | |
| 61 | 1 | < 0.1% | |
| 60 | 2 | < 0.1% | |
| 57 | 1 | < 0.1% | |
| 56 | 1 | < 0.1% |
| Distinct | 59 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.929984441 |
|---|---|
| Minimum | 0 |
| Maximum | 77 |
| Zeros | 851 |
| Zeros (%) | 9.5% |
| Memory size | 70.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 9 |
| 95-th percentile | 23 |
| Maximum | 77 |
| Range | 77 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 7.882655355 |
|---|---|
| Coefficient of variation (CV) | 1.137470859 |
| Kurtosis | 6.885521741 |
| Mean | 6.929984441 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.229124081 |
| Sum | 62356 |
| Variance | 62.13625544 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 1 | 1326 | 14.7% | |
| 2 | 981 | 10.9% | |
| 0 | 851 | 9.5% | |
| 3 | 848 | 9.4% | |
| 4 | 675 | 7.5% | |
| 5 | 519 | 5.8% | |
| 6 | 477 | 5.3% | |
| 7 | 446 | 5.0% | |
| 8 | 357 | 4.0% | |
| 9 | 309 | 3.4% | |
| Other values (49) | 2209 | 24.5% |
| Value | Count | Frequency (%) | |
| 0 | 851 | 9.5% | |
| 1 | 1326 | 14.7% | |
| 2 | 981 | 10.9% | |
| 3 | 848 | 9.4% | |
| 4 | 675 | 7.5% |
| Value | Count | Frequency (%) | |
| 77 | 1 | < 0.1% | |
| 72 | 1 | < 0.1% | |
| 62 | 1 | < 0.1% | |
| 59 | 1 | < 0.1% | |
| 58 | 2 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 282 |
| Missing (%) | 3.1% |
| Memory size | 70.3 KiB |
| 1 | |
|---|---|
| 0 | |
| (Missing) | 282 |
| Value | Count | Frequency (%) | |
| 1 | 6164 | 68.5% | |
| 0 | 2552 | 28.4% | |
| (Missing) | 282 | 3.1% |
per_net_purchase
Real number (ℝ≥0)
| Distinct | 82 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.42898422 |
|---|---|
| Minimum | 4 |
| Maximum | 88 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.3 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 28 |
| median | 45 |
| Q3 | 57 |
| 95-th percentile | 69 |
| Maximum | 88 |
| Range | 84 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 18.49574245 |
|---|---|
| Coefficient of variation (CV) | 0.4359223486 |
| Kurtosis | -1.03466056 |
| Mean | 42.42898422 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | -0.2664532226 |
| Sum | 381776 |
| Variance | 342.0924887 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 56 | 215 | 2.4% | |
| 54 | 214 | 2.4% | |
| 57 | 212 | 2.4% | |
| 55 | 199 | 2.2% | |
| 58 | 192 | 2.1% | |
| 61 | 192 | 2.1% | |
| 53 | 189 | 2.1% | |
| 13 | 188 | 2.1% | |
| 60 | 185 | 2.1% | |
| 59 | 184 | 2.0% | |
| Other values (72) | 7028 | 78.1% |
| Value | Count | Frequency (%) | |
| 4 | 1 | < 0.1% | |
| 5 | 3 | < 0.1% | |
| 6 | 15 | 0.2% | |
| 7 | 35 | 0.4% | |
| 8 | 54 | 0.6% |
| Value | Count | Frequency (%) | |
| 88 | 1 | < 0.1% | |
| 84 | 1 | < 0.1% | |
| 83 | 1 | < 0.1% | |
| 82 | 3 | < 0.1% | |
| 81 | 3 | < 0.1% |
gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.3 KiB |
| M | |
|---|---|
| F |
| Value | Count | Frequency (%) | |
| M | 5784 | 64.3% | |
| F | 3214 | 35.7% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
education
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 47 |
| Missing (%) | 0.5% |
| Memory size | 70.3 KiB |
| Graduation | |
|---|---|
| 2nd Cycle | |
| Master | |
| 1st Cycle | |
| PhD |
| Value | Count | Frequency (%) | |
| Graduation | 4429 | 49.2% | |
| 2nd Cycle | 1496 | 16.6% | |
| Master | 1303 | 14.5% | |
| 1st Cycle | 1104 | 12.3% | |
| PhD | 593 | 6.6% | |
| OldSchool | 26 | 0.3% | |
| (Missing) | 47 | 0.5% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.631029118 |
| Min length | 3 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 177 |
| Missing (%) | 2.0% |
| Memory size | 70.3 KiB |
| Married | |
|---|---|
| Single | |
| Together | |
| Divorced | |
| Widow |
| Value | Count | Frequency (%) | |
| Married | 3273 | 36.4% | |
| Single | 2293 | 25.5% | |
| Together | 2118 | 23.5% | |
| Divorced | 677 | 7.5% | |
| Widow | 445 | 4.9% | |
| Whatever | 15 | 0.2% | |
| (Missing) | 177 | 2.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.879862192 |
| Min length | 3 |
description
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.3 KiB |
| OK nice! | |
|---|---|
| Meh... | |
| Kind of OK | |
| Take my money!! | |
| Horrible | 41 |
| Value | Count | Frequency (%) | |
| OK nice! | 3434 | 38.2% | |
| Meh... | 2107 | 23.4% | |
| Kind of OK | 2090 | 23.2% | |
| Take my money!! | 1326 | 14.7% | |
| Horrible | 41 | 0.5% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 15 |
|---|---|
| Median length | 8 |
| Mean length | 9.027783952 |
| Min length | 6 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
First rows
| age | income | frq | rcn | mnt | clothes | kitchen | small_appliances | toys | house_keeping | dependents | per_net_purchase | gender | education | status | description | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1946 | 90782.0 | 33 | 66 | 1402 | 37 | 5 | 44 | 10 | 3 | 0.0 | 19 | M | Graduation | Together | Take my money!! |
| 1 | 1936 | 113023.0 | 32 | 6 | 1537 | 55 | 1 | 38 | 4 | 2 | 0.0 | 9 | F | PhD | Divorced | Take my money!! |
| 2 | 1990 | 28344.0 | 11 | 69 | 44 | 32 | 19 | 24 | 1 | 24 | 1.0 | 59 | M | Graduation | Married | Kind of OK |
| 3 | 1955 | 93571.0 | 26 | 10 | 888 | 60 | 10 | 19 | 6 | 5 | 1.0 | 35 | F | Master | NaN | OK nice! |
| 4 | 1955 | 91852.0 | 31 | 26 | 1138 | 59 | 5 | 28 | 4 | 4 | 1.0 | 34 | F | Graduation | Together | Take my money!! |
| 5 | 1982 | 22386.0 | 14 | 65 | 56 | 47 | 2 | 48 | 2 | 1 | 1.0 | 67 | M | PhD | Single | OK nice! |
| 6 | 1969 | 69485.0 | 18 | 73 | 345 | 71 | 7 | 13 | 1 | 8 | 1.0 | 46 | M | Graduation | Together | OK nice! |
| 7 | 1960 | 68602.0 | 5 | 44 | 41 | 84 | 1 | 12 | 2 | 0 | 1.0 | 37 | M | Graduation | Together | Horrible |
| 8 | 1940 | 109499.0 | 30 | 75 | 1401 | 38 | 9 | 35 | 9 | 9 | 0.0 | 17 | M | Graduation | Divorced | OK nice! |
| 9 | 1994 | 23846.0 | 8 | 153 | 19 | 18 | 55 | 17 | 10 | 1 | 1.0 | 39 | F | 1st Cycle | Together | Meh... |
Last rows
| age | income | frq | rcn | mnt | clothes | kitchen | small_appliances | toys | house_keeping | dependents | per_net_purchase | gender | education | status | description | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8988 | 1947 | 100928.0 | 28 | 6 | 1152 | 74 | 3 | 20 | 1 | 2 | 0.0 | 29 | F | Master | Divorced | Take my money!! |
| 8989 | 1947 | 87605.0 | 21 | 18 | 823 | 34 | 21 | 9 | 35 | 1 | 0.0 | 9 | M | 1st Cycle | Widow | Kind of OK |
| 8990 | 1995 | 28144.0 | 10 | 41 | 46 | 11 | 40 | 24 | 22 | 2 | 1.0 | 59 | M | 1st Cycle | Married | OK nice! |
| 8991 | 1939 | 126254.0 | 46 | 36 | 2231 | 32 | 4 | 47 | 9 | 8 | 0.0 | 22 | M | Graduation | Divorced | Take my money!! |
| 8992 | 1954 | 87399.0 | 25 | 1 | 837 | 56 | 8 | 27 | 8 | 1 | NaN | 47 | M | Graduation | Married | Kind of OK |
| 8993 | 1960 | 94367.0 | 28 | 1 | 896 | 68 | 5 | 21 | 3 | 4 | 1.0 | 55 | F | 1st Cycle | Single | Take my money!! |
| 8994 | 1975 | 58121.0 | 12 | 6 | 61 | 53 | 6 | 28 | 7 | 6 | 1.0 | 71 | M | 2nd Cycle | Single | Meh... |
| 8995 | 1986 | 54292.0 | 29 | 72 | 1011 | 41 | 11 | 36 | 1 | 11 | 0.0 | 31 | M | Graduation | Together | Take my money!! |
| 8996 | 1938 | 125962.0 | 38 | 75 | 1668 | 61 | 2 | 25 | 5 | 6 | 1.0 | 45 | M | 2nd Cycle | Married | Take my money!! |
| 8997 | 1994 | 26385.0 | 9 | 24 | 46 | 5 | 13 | 21 | 46 | 15 | 1.0 | 52 | M | 1st Cycle | Single | Kind of OK |